Methods Inf Med 1993; 32(05): 388-395
DOI: 10.1055/s-0038-1634951
Original Article
Schattauer GmbH

Influence of “Outliers” on the Association between Laboratory Data and Histopathological Findings in Liver Biopsy

E. Krusinska
1   Department of Medical Informatics, Linköping University, Sweden
2   Department of Mathematics and Informatics, Conservatoire National des Arts et Métiers, France
3   Institute of Electrical Metrology, Technical University of Wroclaw, Poland
,
U. L. Mathiesen
4   Department of Internal Medicine, Linköping University Hospital, Sweden
6   Department of Internal Medicine, Oskarshamn Hospital, Sweden
,
L. Franzén
5   Department of Pathology, Linköping University Hospital, Sweden
,
G. Bodemar
4   Department of Internal Medicine, Linköping University Hospital, Sweden
,
O. Wigertz
1   Department of Medical Informatics, Linköping University, Sweden
› Author Affiliations
Further Information

Publication History

Publication Date:
08 February 2018 (online)

Preview

Abstract

Discriminant analysis techniques were used to predict the histopathological findings in liver biopsy specimens in asymptomatic patients with slightly to moderately raised routine liver tests. Moderate to severe fibrosis and/or inflammation were treated as indication for biopsy. Two methods were used to classify patients. One was the dichotomous discrimination between “biopsy necessary” or “biopsy not necessary” groups of patients. The other involved combining two discriminant functions trained separately for recognition of fibrosis or inflammation, and then combined to predict the biopsy necessity. Detection of outliers by standard techniques, directly available in the SPSS-X package, was performed before starting discrimination procedures. Both “sharp” assignment rules and continuous scoring rules were applied to the classification problem. The correct classification rate reached over 85% for the algorithms tested. In the majority of cases the classification was found to be “non-doubtful”. Elimination of outliers (especially by standardized residuals) improved the global correct classification rate, but only slightly improved assignment to the “biopsy necessary” group. Routine and complementary laboratory findings were found to be the most discriminating; answers to questionnaire and ultrasound examination were less important. Selection of the most diagnostic features based on “clean” data without outliers enabled us to find interesting medical associations, which were previously masked by extremely asymptomatic values outlying from the main body of the “biopsy necessary” group.